Distributed Algorithms for Computing Alternate Paths Avoiding Failed Nodes and Links
نویسندگان
چکیده
A recent study characterizing failures in computer networks shows that transient single element (node/link) failures are the dominant failures in large communication networks like the Internet. Thus, having the routing paths globally recomputed on a failure does not pay off since the failed element recovers fairly quickly, and the recomputed routing paths need to be discarded. In this paper, we present the first distributed algorithm that computes the alternate paths required by some proactive recovery schemes for handling transient failures. Our algorithm computes paths that avoid a failed node, and provides an alternate path to a particular destination from an upstream neighbor of the failed node. With minor modifications, we can have the algorithm compute alternate paths that avoid a failed link as well. To the best of our knowledge all previous algorithms proposed for computing alternate paths are centralized, and need complete information of the network graph as input to the algorithm.
منابع مشابه
Algorithms for Finding Disjoint Paths in Mobile Networks
Disjoint paths are useful in mobile networks for fault tolerance, increasing bandwidth, and achieving better load balance. Communication over multiple disjoint paths is a less expensive alternative to flooding the network. We present a distributed algorithm for finding k disjoint paths, for any given k, between a source S and a destination T in mobile networks where the links have uniform weigh...
متن کاملThesis Proposal Efficient Algorithms for Alternate Paths and Bridges in Networks and Geometric Graphs
Transient single element (node/link) failures dominate the failures in today’s large communication networks like the Internet [MIB+04]. Thus, having the routing paths globally recomputed on a failure does not pay off since the failed element recovers fairly quickly, and the recomputed routing paths need to be discarded. A popular technique for dealing with such issues is that of using a proacti...
متن کاملEfficient Algorithms and Routing Protocols for Handling Transient Single Node Failures
Single node failures represent more than 85% of all node failures in the today’s large communication networks such as the Internet [10]. Also, these node failures are usually transient. Consequently, having the routing paths globally recomputed does not pay off since the failed nodes recover fairly quickly, and the recomputed routing paths need to be discarded. Instead, we develop algorithms an...
متن کاملOptimal redundancy against disjoint vulnerabilities in networks
Redundancy is commonly used to guarantee continued functionality in networked systems. However, often many nodes are vulnerable to the same failure or adversary. A “backup” path is not sufficient if both paths depend on nodes which share a vulnerability. For example, if two nodes of the Internet cannot be connected without using routers belonging to a given untrusted entity, then all of their c...
متن کاملThe Reliability Analysis of Distributed Computing Systems with Imperfect Nodes
The reliability of a distributed computing system depends on the reliability of its communication links and nodes and on the distribution of its resources, such as programs and data files. Many algorithms have been proposed for computing the reliability of distributed computing systems, but they have been applied mostly to distributed computing systems with perfect nodes. However, in real probl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/0811.1301 شماره
صفحات -
تاریخ انتشار 2008